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Abstract — This paper deals with distributed beamforming 
techniques for wireless networks with half-duplex amplify-and- 
forward relays. Existing schemes optimize the beamforming 
weights based on the assumption that channel state information 
(CSI) is available at the relays. We propose to use adaptive 
beamforming based on deterministic perturbations and limited 
feedback (1-bit) from the destination to the relays in order to 
avoid CSI at the relays. Two scalable perturbation schemes are 
considered and practical implementation aspects are addressed. 
Simulation results confirm that the proposed techniques closely 
approach optimum performance and have satisfactory tracking 
properties in time-varying environments. 

1. Introduction 

A. Background 

Terminal cooperation in wireless networks has been rec- 
ognized as a means to form virtual arrays that can realize 
spatial diversity in a distributed fashion. An important special 
case is distributed beamforming with half-duplex amplify-and- 
forward (AF) relays. The coherent AF scheme in [1] requires 
local channel phase information at the relays to achieve 
coherent phase combining with equal power at all relays. 
Beamforming with non-uniform power allocation (PA) under a 
sum power constraint [2], [3] and under individual relay power 
constraints [4] offers significant performance gains. However, 
optimal beamforming with PA places strong requirements 
regarding channel state information (CSI) at the relays. For 
centralized arrays with co-located antennas, this requirement 
has been circumvented by adaptive gradient beamforming 
techniques that iteratively adjust the beamforming weights us- 
ing stochastic vector perturbations and limited feedback from 
the destination [5]. A related approach based on deterministic 
perturbations is presented in [6]. In a similar spirit, feedback- 
assisted distributed beamforming with phase perturbation in 
wireless networks was considered in [7] and extended to the 
multiuser context in [8]. However, both methods do not assume 
a relay setup and do not address distributed PA. 

B. Contribution and Organization of Paper 

We consider perturbation-based beamforming (PB-BF) with 
1-bit feedback in a relay network. Under the assumption of a 
sum power constraint, the relays use the feedback bit to adapt 
their beamforming weights in order to maximize either the 
signal-to-noise ratio (SNR) or the received signal power at the 
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destination. This approach does not require any CSI at the re- 
lays. Two different perturbation schemes are investigated, both 
of which are based on deterministic perturbation sets to avoid 
extensive signaling/feedback overhead. Within this context, we 
present a scalable protocol, discuss implementation aspects, 
and provide numerical performance comparisons. Simulation 
results corroborate that our approach can satisfactorily track 
time-varying channels in non-static environments. We note that 
in the context of wireless ad-hoc networks a related idea was 
touched upon in [9] without explicitly addressing the important 
practical problem of weight exchange. 

The rest of the paper is organized as follows. Section 
mi introduces the system model and Section [III] proposes 
perturbation-based distributed beamforming with 1-bit feed- 
back. A comparison with optimum batch solutions is provided 
in Section |lVl Section [V] discusses simulation results and 
conclusions are provided in Section IVII 

II. System Model 

We consider a perfectly synchronized wireless network with 
single antenna nodes where a single source S communicates 
with a single destination V via R half-duplex relays TZi, i = 
1, . . . , i? (cf. Fig.[T]i. The half-duplex constraint necessitates a 
two-hop protocol. In the first hop, S transmits the signal y/P^s 
to the relays which receive 



= V PghiS + Wi, i ^ 1, . 



,R. 



(1) 



Here, s is the transmit symbol normalized as E{|sp} = 1 (E{-} 
denotes expectation), Ps denotes the average transmit power of 
S, hi is the complex coefficient of the flat fading "backward" 
channel between S and TZi, and Wi ~ CAf{0, Nq) denotes 
i.i.d. complex Gaussian noise. In the AF scenario considered, 
the second hop amounts to each relay transmitting a complex 
scaled version of the signal it has received, i.e.. 



a* Xi Xi , with \i = 



P 



' R|/i,:|2 + iVn 



(2) 



Here, complex conjugation (superscript *) of the beamform- 
ing weights ai will simplify notation later on, and \i is 
a power normalization factor such that the average relay 
power is E{|rip|/ii} = jofipP. The destination receives 
y — X^iLi gifi + v, where gi denotes the complex coefficient of 
the "forward" channel between TZi and T>, and v^CJ\f{Q, Nq) 

'Note that our discussion does not presume specific channel statistics. 
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Fig. 1. Wireless relay network with feedback. 

is complex Gaussian noise. Inserting ([U and ^ yields the 
compound channel modej^ 



Here, h = 
diag(gi,... 
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with /li = higiX^ 
hXi, and w= [tui . . . 
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the weight vector a = [ai . . . enters also the noise part in 
it demands careful design to prevent noise amplification. 
From (|3]l, the average power corresponding to the signal 
part of y and the SNR at V are respectively obtained as 

|2 
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In the following, we will use 7(0;) as generic notation for our 
objective function, which can either be Pp(q;) or p{a). The 
beamforming vector a can be batch designed to maximize 
7(a) subject to a specific relay power constraint. We resort 
to two types of power constraints: Constraining the complex 
beamforming weights to jaip = 1 (this amounts essentially 
to phase-matching at the relays [7]) ensures identical per- 
relay power E{|rip|/ii}=:P. In contrast, the total sum power 
constraint E{ X^ili kiPl^ii • • • 1 ^-r} = P requires that the 
beamforming vector has unit Euclidean norm, ||q;||^ = 1. 
However, such batch designs entail stringent requirements 
regarding the CSI available to the relays (cf. Section ITVb . 

III. Adaptive Perturbation-based Beamforming 
A. Transmission Principle 

To avoid CSI at the relays, we study distributed beamform- 
ing using feedback-assisted adaptive weight perturbation. The 
idea underlying this approach is to maximize the objective 
function 7(0;) by adjusting the beamforming weights at the re- 
lays in an iterative manner using limited feedback (see Fig.[T|). 
For co-located arrays with centraUzed processing similar ideas 
have been proposed in [5], [6]. 

Transmission happens in frames consisting of a training 
interval Tp and a data interval The relays use different 
beamforming weights to forward the training and data parts of 
each frame received from 5 to P according to (O. The idea 

^Superscript (^) denotes (Hermitian) transposition; diag(xi, . . . , Xm) 
is the m X m diagonal matrix with diagonal elements xi, . . . , Xm- 



is to apply the currently best beamforming vector, denoted 
OLk (k is the frame index), to the data while using a perturbed 
version a.k of the beamforming vector for the training portion. 
The destination evaluates the effectiveness of the perturbed 
weights and checks whether or not the perturbation improved 
the objective function 7(0;^). It then provides the relays with 
one bit of feedback to indicate which beamforming vector shall 
be used to forward the data of the next frame. 

In the proposed scheme, the weights ctfc are obtained by 
an additive perturbation from the data beamforming vector 
OLk- While in a centralized setup the perturbation vectors can 
be chosen randomly for each frame, our distributed setup 
necessitates a deterministic vector set, collected in a i? x 
matrix Q = [qo . . . Qtv-i], from which the perturbation vector 
is picked in a cyclic fashion (cf. Section Ull-Db . 

In the following, we present two variants of the proposed 
PB-BF scheme and describe the individual steps in more detail. 
For this discussion, we assume that all channel coefficients 
remain constant during the weight adaptation process. 

B. Take/Reject (T/R) Perturbation 

The perturbed weights for the kth frame are computed as 

a'k = Oik + Hq.kmodN, (6) 

where ^ is a step-size parameter determining the adaptation 
rate, followed by proper normalization, i.e., ct^ = q;'j./||q;J,| 
in case of a sum power constraint and ak,i — ct'/, ^l\oi!^ J for 
a per-relay power constraint. The weights otk are applied to 
the training sequence received at the relays, which is then 
forwarded to P. At V, the known training sequence and the 
receive signal are used to evaluate the performance of cife 
within 7^ according to the objective function, i.e., 7^=7(0;^). 
Recall that 7(-) represents either the received signal power 
Pt){-) in (HI or the SNR p( ) in The actual estimation of 
these quantities will be addressed in Section IIII-EI 

The destination then compares 7^ to the performance 7^ = 
7(afe) achieved with the beamforming vectors otk that up to 
this point performed best. If 7^ < 7fc, a.^ does not perform 
better than ol^ and hence the relays should stick with a.^ 
for the data in the next transmission frame ("reject" 0.^). 
Otherwise (7^ > 7^), the beamforming vector a.^ improves on 
afc and should thus be used in the next frame to transmit the 
data ("take" otk). This rationale can be implemented be letting 
V provide the relays with a single bit of feedback, given by 

Cfc = u(7fc-7fc), 

where u(.) denotes the unit step function. Depending on the 
feedback bit, the relays update the data beamforming vector 
for the next frame as 



OLk+\ 



OLk, if Cfc = 0, 

OLk, if Cfc = 1. 



The vector OLk+\ will be the basis for the next perturbation 
according to (|6]l. The destination 2? performs the correspond- 
ing update 7fc+i = max{7fc,7fc}. This process continues in 
an iterative manner. During the first frame, the scheme is 



initialized with ao = ctQ (by setting, e.g., ~ [1 
in case of a sum power constraint) and 70 ~ 0. 

C. Plus/Minus (P/M) Perturbation 

T/R perturbation has the advantage that performance never 
deteriorates, i.e., 7(Q:fc_|-i) > '^{cx.k). On the other hand, in 
many cases the perturbation (|6j will not yield an improvement, 
which entails ccfc+i — olu and hence slow adaptation. We 
next discuss an alternative perturbation scheme with faster 
adaptation rate. Here, the training interval 7J, is split into two 
halves 7p+ and for which different perturbed beamforming 
vectors are used, i.e.Jl 

\\ak+^J■(\kmoAN\y ^ \\a.k-^J'^.kmodN\\' 

The destination V then measures the performance of and 
q;j7 by evaluating the objective function according to 7^!" ~ 
7(0:^) and 7^ = ^{oL^) within 7p+ and , respectively. 
While in principle we could pick the beamforming weights 
corresponding to the maximum of 7^*^, 7^7, and 7fc = 7(cKfc) 
(the performance of the current data beamforming vector), 
1-bit feedback can only support binary choices. Hence, a.k 
will be discarded in any case. T) broadcasts the feedback bit 
Cfc = u(7j7— 7j!') to the relays, indicating whether the "plus" 
perturbation or the "minus" perturbation performs 
better. In the next frame, the relays use the beamforming vector 
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OLk+l 



if Cfc 
if Cfc 



Although P/M perturbation shows typically faster adaptation 
than T/R, sometimes both perturbations in (|7|i deteriorate the 
performance with respect to 7^. Thus, P/M performance may 
fluctuate continually. Furthermore, only half of the training 
interval can be used to estimate each of 7^^ and 7^. 

D. Perturbation Set 

Vector normalization of the weights in (|6]l and (|7]i ensures 
that the sum power constraint is satisfied, but requires that each 
relay knows all elements of the beamforming vector. Hence, a 
stochastic gradient algorithm with random perturbation vectors 
(as in [5]) cannot be applied to relay networks; this would 
require to exchange all weights among the relays, thus impos- 
ing a tremendous signaling overhead. Rather, we propose to 
use a matrix Q of deterministic perturbation vectors (cf. [6]) 
known to each relay. This allows each relay to keep track of 
all beamforming weights and to perform vector normalization 
locally. Reasonable choices for the deterministic perturbation 
matrix are Q [F,jF] (i.e., N = 2R vectors) for P/M 
perturbation and Q = [F, jF, -F, -.7F] (N = AR) for T/R 
perturbation; here, F is an i?x_R unitary matrix. We observed 
that choosing F as discrete Fourier transform (DFT) matrix 
provides good performance under various conditions. 

Note that with the per-relay power constraint, element-wise 
normalization does not require knowledge of all weights at 
each relay, thus allowing also for stochastic perturbations. 

^These expressions are valid for the sum power constraint. With a per-relay 
power constraint, weight normalization has to be peiformed element-wise. 



We next discuss the estimation of the receive signal power 
P-nia) in dU and the SNR p{a.) in (|5]l which are used 
as performance measures, as well as the estimation of the 
compound channel ^ in (l3]l required for coherent detection. 

In the following, we omit the frame index k and denote the 
pilot sequence within a transmission frame as Sp[n], n£T'. 
The destination can then compute the maximum likelihood 
(ML) estimate of the compound channel as 



(8) 



EnGT' kpMP ' 

Using ([8]), the ML estimates of receive signal power and SNR 
can be obtained as 



R 



V 



P 



|T'| E 
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For T/R perturbation, (O and (|9]l are evaluated using T' = 
Tp. After each weight update the destination stores the channel 
estimate and uses it for data detection of the subsequent frames 
till the next update occurs. With P/M, ^ and (|9]l are calculated 
twice in each frame with T' = 7p+ and T' = . The channel 
estimate corresponding to the better beamforming vector is 
then kept for data detection in the next frame. Alternatively, 
an approximate ML estimate for the channel coefficient ^ can 
be obtained by evaluating ^ over the whole training interval 
(T' =7^) within the same frame (cf. [5]), provided that the 
step-size ji is chosen sufficiently small and \T^\ 



IV 



F. Birth and Death of Relays 

Our deterministic perturbation approach is scalable in that 
it can be easily adapted to deal with the situation where relays 
enter ("birth") or leave ("death") the network, even in the 
case of a sum power constraint. We assume that the maximum 
number of relays is i?,nax, of which R < i?max are active and 
can exchange information with 1) but not with each other In 
essence, the destination and each relay keep track of all the 
active relays. The relays can then compute the required vector 
norm locally. Additionally, all relays know their "identity" 
(index i), which is fixed and enables them to pick their 
corresponding beamforming weight. If a relay TZig drops out, 
it informs I? which in turn broadcasts the relay index io to 
the remaining relays using log2(i?max) bits. These relays then 
exclude the corresponding beamforming/perturbation weight 
from the update process. If a new relay enters the system, it 
contacts T) which in turn broadcasts i?max bits to indicate to 
all relays (also to the new one) which relays are active. Since 
the new relay cannot know the current beamforming weights 
of the other relays, the weight adaptation process needs to be 
re-initialized in this case. 

In the case of a per-relay power constraint, element-wise 
weight normalization allows that the relays only need to track 
their own weights. This renders a birth-and-death protocol 
particularly easy, since relays can enter or leave the system 
completely arbitrarily without informing the other relays. 



IV. Comparison with Optimal Beamforming 

We next compare optimal batch beamforming designs with 
adaptive PB-BF. The former requires each relay having either 
local CSI (i.e., each relay's own back- and forward channel) 
or global CSI (i.e., all channels) available, whereas PB-BF 
exploits limited feedback to avoid CSI at the relays. 

A. Optimal Batch Designs 

Equal Gain Combining (EGC). Maximizing Pviot) or p{cx.) 
under a per-relay power constraint yields the beamforming 
weights Oi — hi/\hi\ = higi/\higi\ that amount to coherent 
combining [1]. This scheme requires that each relay knows 
the phase of its backward and forward channel. 

Power Maximization under Sum Power Constraint (P-SP). 
Optimizing Pt>{oi.) in (|4|i under a sum power constraint 
amounts to maximizing |a^h| subject to ||q!|P = 1. Via 
the Cauchy-Schwarz inequality, the solution is obtained as 
cti = /ii/||h|| requiring global CSI at TZi. Alternatively, if 
global CSI is available at V, each relay needs only local 
CSI and feedback of j|h|| from V. This shows that the relays 
optimally allocate their transmit power to match the current 
local fading coefficients while performing coherent combining. 

SNR Maximization under Sum Power Constraint ( S-SP). The 
beamforming vector a can also be chosen to maximize the 
SNR p{cx.) in (|5]l [2], [3]. Under the sum power constraint 
||q;|P = 1 this can be shown to lead to a generalized eigenvalue 
problem whose solution is (I is the identity matrix) [3] 

(I + GGg)-ih 

" = ii(i+GG-)-hir ^''^ 

Again this essentially requires either global CSI at the relays 
or local CSI with feedback of ||(I + GG^)-ih|| from V. In 
contrast to P-SP, (fTOl i also accounts for noise amplification. 

B. Comparison with PB-BF Schemes 

It can be shown that p(a) and Pv{cy.) have only a global 
maximum (unique up to phase ambiguity) under both power 
constraints and this maximum is achieved by the correspond- 
ing optimal batch design. The proposed PB-BF schemes aim to 
maximize p{a.) or Px>{ct), and indeed approach their optimal 
counterparts (cf. Section[V]i. EGC can be approximated by PB- 
BF using element-wise normalization and the objective func- 
tion 7(0:) chosen as received signal power Px>{a). P-SP and 
S-SP performance can be approached using 7(a) = Pv{ot) 
and 7(0:) = p{a) as objective function, respectively, and 
vector normalization of the beamforming weights. The PB-BF 
schemes can be implemented via T/R or P/M perturbation. 

V. Simulation Results 

We next investigate a network with i? = 3 relays via 
numerical simulations; we will refer to the PB-BF schemes by 
adding the prefix 'PB-' to the corresponding batch design. For 
a fair comparison, all schemes use the same total relay power 
P (in (|2]i we thus have P — P/R under a per-relay power 
constraint and P — P under a sum power constraint). The 
source S transmits BPSK symbols with transmit power Ps=P 



and the destination V employs an ML detector. We further 
assume error- and delay-free 1-bit feedback, and employ a 
deterministic perturbation set based on a 3 x 3 DFT matrix. 

A. Idealized Scenario 

In this scenario, all channels are static i.i.d. Rayleigh fading 
with different path loss, i.e., hi,gi ^ CJ\f{0,d~^) with di = 
1, 3, 5. Each relay perfectly knows its backward channel (used 
in dU), and V has perfect knowledge of the compound channel 
^ and the performance measures Pv{<y.) and p{a). Unless 
stated otherwise, P/A'^o = 18dB. 

Convergence Behavior For the case of PB-S-SP using P/M 
and T/R perturbation with step size p ~ 0.1 and p = 0.3, 
Fig. 12 a) shows the evolution of the receive SNR pk^p{ak) 
(normalized by the maximum receive SNR) versus the frame 
index k for one channel realization. It is seen that with T/R pk 
is nondecreasing and reaches almost optimal performance; a 
larger step size results in faster convergence but also in a larger 
gap to the optimum. Similar observations apply to P/M, which 
converges significantly faster than T/R, but features continual 
fluctuations whose amplitude increases with the step size. 

For a systematic assessment of the convergence rate of PB- 
S-SP (with p~0.1), Fig.|2b) shows the empirical cumulative 
distribution function (cdf) of the normalized SNR gap that 
remains after a certain number of frames (shown as curve 
labels). The cdfs were obtained with 10^ fading realizations. 
P/M converges considerably faster than T/R. To achieve an 
SNR gap of less than 4.3% in 91% of the cases, P/M and T/R 
respectively require 40 and 70 iterations. However, after a large 
number of frames, T/R on average features a considerably 
smaller SNR gap than P/M. Our simulations also revealed that 
a larger number of relays leads to slower convergence; for 
space reasons, the corresponding curves cannot be shown here. 

BER Performance. Fig.|2c) plots bit-error rate (BER) versus 
nominal SNR P/Nq (in dB) for the batch designed beamform- 
ing schemes EGC, P-SP, S-SP, and their perturbation-based 
counterparts. In each simulation run, only the frames after 
convergence of the PB-BF schemes were taken into account 
for the BER evaluation (again p = 0.1). As a reference, we 
include an AF scheme that uses uniform PA and no coherent 
combining (labeled 'no BE'). 

It can be seen that all PB-BF performance curves are almost 
indistinguishable from those of their corresponding batch de- 
signs and offer significant gains over the no-BF case (e.g., 8 dB 
SNR improvement at a BER of 10"^). SNR optimization (PB- 
S-SP) is seen to outperform power optimization (PB-P-SP) at 
high SNR. In fact, PB-S-SP and S-SP are the only schemes to 
achieve a diversity larger than 1. Power optimization under a 
sum power constraint (PB-P-SP) and under a per-relay power 
constraint (PB-EGC) perform almost identically; in fact, PB- 
P-SP appears to suffer from noise amplification at high SNR. 

B. Realistic Scenario 

We next use independent, time-varying flat fading channels 
with Jakes Doppler profile and the same path loss model as in 
Section IV-AI Furthermore, the destination uses (O and (|9]l to 




Fig. 2. PB-BF performance for idealized scenario witli static cliannels in a 3-relay network: (a) example for SNR evolution at © (P/Nq = 18 dB), (b) cdf 
of the SNR gap after a fixed number of frames (P/Nq = 18 dB, fj, = 0.1), and (c) BER versus nominal SNR (p = 0.1). 



estimate the compound channel, the received signal power, and 
the instantaneous SNR. To this end, each transmission frame 
contains |7J,| = 10 pilot symbols in addition to |7d| =40 data 
symbols. The normalization in (|2]i is achieved by measuring 
the receive power at the relays during one frame. 

We analyze the tracking capabilities of P/M perturbation 
(with /i = 0.1 and /i = 0.5) in terms of BER versus normalized 
Doppler frequency (i.e., Doppler in Hertz times frame length 
in seconds) for P/A'o = 22dB (see Fig.O. In general, the BER 
degrades with increasing Doppler At high Doppler frequencies 
the relay weights cannot be adapted fast enough to the channel 
variations (note that in practice, the feedback delay will add on 
top of this). Moreover, if there are channel variations within 
a frame, the compound channel and the objective function 
cannot be estimated accurately. Even at low Doppler, there is 
an order of magnitude BER penalty for PB-S-SP (cf. Fig. [3] 
with n^O.l and Fig. Etc) at P/iVo = 22dB). 

We observe that at low Doppler frequencies a small step 
size (/i = 0.1) performs better whereas at higher Doppler 
frequencies a larger step size (/i = 0.5) is advantageous since 
it allows quicker adjustment of the relay weights. Note that 
with /i = 0.1, PB-S-SP looses its entire performance advantage 
over PB-P-SP at high Doppler frequencies. 

In time-varying scenarios, T/R suffers from the fact that 
the beamforming weights are not updated when the channel 
quaUty gets worse. This can be circumvented by building a 
forgetting factor into the performance measure 7^. 

VI. Conclusion 

We have investigated scalable perturbation-based distributed 
beamforming protocols in wireless relay networks that exploit 
1-bit feedback to approach the optimal beamforming weights 
in an adaptive manner while avoiding CSI at the relay nodes. 
We used a deterministic perturbation set to optimize either re- 
ceived signal power or SNR at the destination under per-relay 
or sum power constraints. The best performance was observed 
with SNR as objective function under a sum power constraint. 
At high SNR, equal gain combining appears to be preferable 
over power optimization under a sum-power constraint. In 
time-varying environments, the proposed perturbation schemes 
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Fig. 3. BER versus normalized Doppler frequency for various PB-BF 
schemes using P/M perturbation under a realistic scenario (P/Ng =22dB). 

require a careful choice of the step-size parameter and the 
transmission frame length. 
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